智能论文笔记

Identification of Binary Neutron Star Mergers in Gravitational-Wave Data Using YOLO One-Shot Object Detection

João Aveiro , Felipe F. Freitas , Márcio Ferreira , Antonio Onofre , Constança Providência , Gonçalo Gonçalves , José A. Font

分类：计算机视觉 | 机器学习

2022-07-01

我们证明了Yolov5模型（一种基于通用卷积的单杆对象检测模型）的应用，在从当前生成干涉仪检测器的重力数据中检测到二进制中子星（BNS）聚合事件的任务。我们还基于用于模型训练，验证和测试步骤的大概波形模型对合成数据生成和准备任务的详尽说明。使用这种方法，我们实现平均平均精度（$ \ text {map} _ {[0.50]} $）的单个类验证数据集的值为0.945，测试数据集的平均值为0.945，高达0.978。此外，训练有素的模型成功地识别了LIGO H1检测器数据中的GW170817事件。 LIGO L1检测器数据也可以通过附加的预处理步骤进行识别，而无需在Inspiral的最后阶段消除大故障。 GW190425事件的检测不太成功，这证明了信噪比的性能退化。我们的研究表明，Yolov5模型是第一阶段检测警报管道的有趣方法，并且在整合到更复杂的管道中时，用于实时推断物理源参数。

translated by 谷歌翻译

Models under which random forests perform badly; consequences for applications

José A. Ferreira

分类： (统计)机器学习 | 机器学习

2019-10-02

我们举例说明数据生成模型的示例，其中Breiman的随机森林可能极慢地收敛到最佳预测器，甚至无法保持一致。为这些属性提供的证据是基于主要是直观的论点，类似于前面使用的那些具有更简单的示例以及数值实验。虽然可以始终选择随机森林表现得非常严重的模型，但我们表明基于“变量使用”和“变量重要性”统计的简单方法通常可用于构建基于“许多武装”的更好的预测因子通过强制初始拆分获得的随机森林，该变量是算法的默认版本倾向于忽略的变量。

translated by 谷歌翻译

Anomaly detection in laser-guided vehicles' batteries: a case study

Gianfranco Lombardo , Stefano Cagnoni , Stefano Cavalli , Juan José Contreras Gonzáles , Francesco Monica , Monica Mordonini , Michele Tomaiuolo

分类：机器学习

2022-12-27

Detecting anomalous data within time series is a very relevant task in pattern recognition and machine learning, with many possible applications that range from disease prevention in medicine, e.g., detecting early alterations of the health status before it can clearly be defined as "illness" up to monitoring industrial plants. Regarding this latter application, detecting anomalies in an industrial plant's status firstly prevents serious damages that would require a long interruption of the production process. Secondly, it permits optimal scheduling of maintenance interventions by limiting them to urgent situations. At the same time, they typically follow a fixed prudential schedule according to which components are substituted well before the end of their expected lifetime. This paper describes a case study regarding the monitoring of the status of Laser-guided Vehicles (LGVs) batteries, on which we worked as our contribution to project SUPER (Supercomputing Unified Platform, Emilia Romagna) aimed at establishing and demonstrating a regional High-Performance Computing platform that is going to represent the main Italian supercomputing environment for both computing power and data volume.

translated by 谷歌翻译

SegAugment: Maximizing the Utility of Speech Translation Data with Segmentation-based Augmentations

Ioannis Tsiamas , José A. R. Fonollosa , Marta R. Costa-jussà

分类：自然语言处理

2022-12-19

Data scarcity is one of the main issues with the end-to-end approach for Speech Translation, as compared to the cascaded one. Although most data resources for Speech Translation are originally document-level, they offer a sentence-level view, which can be directly used during training. But this sentence-level view is single and static, potentially limiting the utility of the data. Our proposed data augmentation method SegAugment challenges this idea and aims to increase data availability by providing multiple alternative sentence-level views of a dataset. Our method heavily relies on an Audio Segmentation system to re-segment the speech of each document, after which we obtain the target text with alignment methods. The Audio Segmentation system can be parameterized with different length constraints, thus giving us access to multiple and diverse sentence-level views for each document. Experiments in MuST-C show consistent gains across 8 language pairs, with an average increase of 2.2 BLEU points, and up to 4.7 BLEU for lower-resource scenarios in mTEDx. Additionally, we find that SegAugment is also applicable to purely sentence-level data, as in CoVoST, and that it enables Speech Translation models to completely close the gap between the gold and automatic segmentation at inference time.

translated by 谷歌翻译

Going for GOAL: A Resource for Grounded Football Commentaries

Alessandro Suglia , José Lopes , Emanuele Bastianelli , Andrea Vanzo , Shubham Agarwal , Malvina Nikandrou , Lu Yu , Ioannis Konstas , Verena Rieser

分类：计算机视觉 | 自然语言处理

2022-11-08

Recent video+language datasets cover domains where the interaction is highly structured, such as instructional videos, or where the interaction is scripted, such as TV shows. Both of these properties can lead to spurious cues to be exploited by models rather than learning to ground language. In this paper, we present GrOunded footbAlL commentaries (GOAL), a novel dataset of football (or `soccer') highlights videos with transcribed live commentaries in English. As the course of a game is unpredictable, so are commentaries, which makes them a unique resource to investigate dynamic language grounding. We also provide state-of-the-art baselines for the following tasks: frame reordering, moment retrieval, live commentary retrieval and play-by-play live commentary generation. Results show that SOTA models perform reasonably well in most tasks. We discuss the implications of these results and suggest new tasks for which GOAL can be used. Our codebase is available at: https://gitlab.com/grounded-sport-convai/goal-baselines.

translated by 谷歌翻译

scikit-fda: A Python Package for Functional Data Analysis

Carlos Ramos-Carreño , José Luis Torrecilla , Miguel Carbajo-Berrocal , Pablo Marcos , Alberto Suárez

分类：机器学习 | (统计)机器学习

2022-11-04

The library scikit-fda is a Python package for Functional Data Analysis (FDA). It provides a comprehensive set of tools for representation, preprocessing, and exploratory analysis of functional data. The library is built upon and integrated in Python's scientific ecosystem. In particular, it conforms to the scikit-learn application programming interface so as to take advantage of the functionality for machine learning provided by this package: pipelines, model selection, and hyperparameter tuning, among others. The scikit-fda package has been released as free and open-source software under a 3-Clause BSD license and is open to contributions from the FDA community. The library's extensive documentation includes step-by-step tutorials and detailed examples of use.

translated by 谷歌翻译

Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts

Stefano Pini , Christian S. Perone , Aayush Ahuja , Ana Sofia Rufino Ferreira , Moritz Niendorf , Sergey Zagoruyko

分类：机器人 | 机器学习

2022-11-03

The goal of autonomous vehicles is to navigate public roads safely and comfortably. To enforce safety, traditional planning approaches rely on handcrafted rules to generate trajectories. Machine learning-based systems, on the other hand, scale with data and are able to learn more complex behaviors. However, they often ignore that agents and self-driving vehicle trajectory distributions can be leveraged to improve safety. In this paper, we propose modeling a distribution over multiple future trajectories for both the self-driving vehicle and other road agents, using a unified neural network architecture for prediction and planning. During inference, we select the planning trajectory that minimizes a cost taking into account safety and the predicted probabilities. Our approach does not depend on any rule-based planners for trajectory generation or optimization, improves with more training data and is simple to implement. We extensively evaluate our method through a realistic simulator and show that the predicted trajectory distribution corresponds to different driving profiles. We also successfully deploy it on a self-driving vehicle on urban public roads, confirming that it drives safely without compromising comfort. The code for training and testing our model on a public prediction dataset and the video of the road test are available at https://woven.mobi/safepathnet

translated by 谷歌翻译

Rethinking Motion Deblurring Training: A Segmentation-Based Method for Simulating Non-Uniform Motion Blurred Images

Guillermo Carbajal , Patricia Vitoria , Pablo Musé , José Lezama

分类：计算机视觉

2022-09-26

成功培训端到端的深网进行真实运动去缩合，需要尖锐/模糊的图像对数据集，这些数据集现实且多样化，足以实现概括以实现真实的图像。获得此类数据集仍然是一项具有挑战性的任务。在本文中，我们首先回顾了现有的Deblurring基准数据集的局限性，从泛化到野外模糊图像的角度。其次，我们提出了一种有效的程序方法，以基于一个简单而有效的图像形成模型来生成清晰/模糊的图像对。这允许生成几乎无限的现实和多样化的培训对。我们通过在模拟对上训练现有的DeBlurring架构，并在四个真实模糊图像的标准数据集中对其进行评估，从而证明了所提出的数据集的有效性。我们观察到使用建议方法训练时动态场景的真实运动毛线照片的最终任务的出色概括性能。

translated by 谷歌翻译

Stochastic strategies for patrolling a terrain with a synchronized multi-robot system

Luis E. Caraballo , José M. Díaz-Báñez , Ruy Fabila-Monroy , Carlos Hidalgo-Toscan

分类：机器人

2022-09-14

可以部署一组合作的空中机器人，以有效地巡逻地形，每个机器人都会在指定区域飞行，并定期与邻居共享信息，以保护或监督它。为了确保鲁棒性，以前对这些同步系统的作品提出了将机器人发送到相邻区域的情况，以防它检测到故障。为了处理不可预测性并提高确定性巡逻计划的效率，本文提出了随机策略，以涵盖在代理之间分配的领域。首先，在本文中针对两个指标进行了对随机过程的理论研究：\ emph {闲置时间}，这是两个连续观察到地形的任何点和\ emph {隔离时间}之间的预期时间，预期的时间}，预期的时间机器人没有与任何其他机器人通信的时间。之后，将随机策略与添加另一个指标的确定性策略进行了比较：\ emph {广播时间}，从机器人发出消息的那一刻，直到团队的所有其他机器人收到消息。模拟表明，理论结果与模拟和随机策略的表现非常吻合，其行为与文献中提出的确定性协议获得的行为相比。

translated by 谷歌翻译

ExplORB-SLAM: Active Visual SLAM Exploiting the Pose-graph Topology

Julio A. Placed , Juan J. Gómez Rodríguez , Juan D. Tardós , José A. Castellanos

分类：机器人

2022-09-08

长期以来，部署能够探索未知环境的自动驾驶机器人一直是与机器人社区有很大相关性的话题。在这项工作中，我们通过展示一个开源的活动视觉猛烈框架来朝着这个方向迈出一步基础姿势图提供的结构。通过仔细估计后验加权姿势图，在线实现了D-最佳决策，目的是在发生探索时改善本地化和映射不确定性。

translated by 谷歌翻译